Semantically Conceptualizing and Annotating Tables

نویسندگان

  • Stephen Lynn
  • David W. Embley
چکیده

Enabling a system to automatically conceptualize and annotate a human-readable table is one way to create interesting semanticweb content. But exactly “how?” is not clear. With conceptualization and annotation in mind, we investigate a semantic-enrichment procedure as a way to turn syntactically observed table layout into semantically coherent ontological concepts, relationships, and constraints. Our semanticenrichment procedure shows how to make use of auxiliary world knowledge to construct rich ontological structures and to populate these ontological structures with instance data. The system uses auxiliary knowledge (1) to recognize concepts and which data values belong to which concepts, (2) to discover relationships among concepts and which datavalue combinations represent relationship instances, and (3) to discover constraints over the concepts and relationships that the data values and data-value combinations should satisfy. Experimental evaluations indicate that the automatic conceptualization and annotation processes perform well, yielding F-measures of 90% for concept recognition, 77% for relationship discovery, and 90% for constraint discovery in web tables selected from the geopolitical domain.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Tool for Creating and Visualizing Semantic Annotations on Relational Tables

Semantically annotating content from relational tables on the Web is a crucial task towards realizing the vision of the Semantic Web. However, there is a lack of open source, user-friendly tools to facilitate this. This paper describes an extension of the TableMiner system, an open source Semantic Table Interpretation system that automatically annotates Web tables using Linked Data in an effect...

متن کامل

An Annotation Framework for the Semantic Web

Creating metadata by annotating documents is one of the major techniques for putting machine understandable data on the Web. Though there exist many tools for annotating web pages, few of them fully support the creation of semantically interlinked metadata, such as necessary for a truely Semantic Web. In this paper, we present an ontology-based annotation environment, OntoAnnotate, which offers...

متن کامل

Creating Metadata for the Semantic Web — An Annotation Environment and the Human Factor

Creating metadata by annotating documents is one of the major techniques for putting machine understandable data on the Web. Though there exist many tools for annotating web pages, few of them fully support the creation of semantically interlinked metadata, such as necessary for a truely Semantic Web. In this paper, we present an ontology-based annotation environment, OntoAnnotate, which offers...

متن کامل

EQuIKa System: Supporting OWL applications with local closed world assumption

One of the major advantages of semantically annotating resources on Web is the facilitation of web services discovery. Languages based on OWL are prune to several problems for web services discovery due to the open-world assumption when handling incomplete information. Thus standard OWL reasoner are usually not suitable for the discovery purposes. The aforementioned problems can easily be fixed...

متن کامل

Two-staged Approach for Semantically Annotating and Brokering Tv-related Services Conference Item Two-staged Approach for Semantically Annotating and Brokering Tv-related Services

Nowadays, more and more distributed digital TV and TV-related resources are published on the Web, such as Electronic Personal TV Guide (EPG) data. To enable applications to access these resources easily, the TV resource data is commonly provided by Web service technologies. The huge variety of data related to the TV domain and the wide range of services that provide it, raises the need to have ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008